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METHOD, SYSTEM AND COMPUTER PROGRAM PRODUCT FOR EVALUATING DOWNLOAD PERFORMANCE 
OF WEB PAGES 

5 Field of the invention 

The present invention relates to techniques for 
evaluating download performance of web pages, such as 
times involved in downloading web pages . 

The invention was developed by paying specific 
10 attention to the possible application to mobile 
telecommunications networks such as GPRS (General 
Packet Radio Service) and UMTS (Universal Mobile 
Telecommunications System) networks. Reference to this 
preferred field of application is not to be construed 
15 as intended to limit the scope of applicability of the 
invention. 

■ Description of the related art 

Download times of web pages in networks such as 
GPRS and UMTS networks are affected by a number of 

2 0 factors. In addition to network performance, the 
characteristics of each web page, such as the number 
and the dimensions of the objects comprised on the 
page, and the browser type used for downloading are 
other factors that come into play in determining 

2 5 download performance of web pages. 

A number of techniques are already available with 
Internet service providers (ISPs) or content providers 
(CPs) in order to verify in a non- intrusive way 
(essentially by way of simulation) the expected 

30 performance of services offered to clients. 

For instance, the NetForecast Report 5055 entitled 
"Understanding Web Performance" by T. Sevcik and J. 
Bartlett, which is an expanded version of an article 
with the same title first published in Business 

35 Communications Review, October 2001, includes a review 
of commercial products adapted for determining download 
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times of certain web pages. A definition is also 
provided of certain parameters that can be adapted to 
those commercial tools in such a way to permit 
simulation of download times. 
5 A basic disadvantage of such prior art techniques 

lies in that they not take into account the role played 
by certain variables such as: 

- the type of browser used for downloading, 

- the types of web pages considered, 

10 _ the specific performance level of certain 

network services being used, for instance in terms of 

available bit rate and/or latency. 

In fact, these variables are essential in 

determining the response time of a network such as a 
15 GPRS or UMTS network to the request for a certain page 

to be downloaded, indicated throughout this document as 

web pages . 

Furthermore, such prior art techniques fail to 
take into account the relationship existing between 
20 notional channel capacity available in terms of bit/s 
and the payload capacity actually available. 
Object and summary of invention 

The object of the present invention is thus to 
provide a technique for predicting download times that 
25 may lead to accurate results and that also lends itself 
to be adapted to the specific characteristics of the 
services provided by a determined service and/or 
contents provider. 

According to the present invention, such an object 
3 0 is achieved by means of a method having the features 
set forth in the claims that follow. The invention also 
relates to a corresponding system as well as to a 
computer program product directly loadable in the 
memory of a computer and including software code 
35 portions for performing the method of the invention 
when the product is run on a computer. 
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A preferred embodiment of the invention evaluates 
download performance parameters of web pages accessible 
via a network by providing at least one model for 
predicting a set of download performance parameters for 
said web pages as a function of a respective set of 
input parameters. The at least one model includes at 
least one optimisation parameter (X) . 

Such a model may typically comprise a module for 
evaluating the sum of: 

at least one first factor determined 
analytically on the basis of network and web page 
parameters, and 

- a second factor being a function, preferably of 
the hyperbolic type, of an optimisation parameter. 
15 A set of sample web pages is defined and said set 

of download performance parameters for the sample web 
pages are both measured and evaluated on the basis of 
the model for different values of the at least one 
optimisation parameter. An error indicative of the 
20 difference between the download performance parameters 
for the sample web pages as measured and as evaluated 
on the basis of said model, respectively, is defined 
and an optimised model is selected including a value of 
the least one optimisation parameter minimising the 
25 error or reducing it below a predetermined value. 
Download performance parameters for any selected set of 
pages accessible through the network (N) can then be 
evaluated without interfering with operation of the 
network on the basis of the optimised model. This is 
30 done (in a non-intrusive manner, i.e. without 
interfering with operation of the network) by way of 
prediction on the basis of the selected model. 

Preferably, the set of download performance 
parameters includes at least one parameter selected 
35 from the group consisting of download time for a given 
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web page and an efficiency index indicative of how said 
given web page exploits the capacity of the network. 

Still preferably, the prediction model is based on 
at least one parameter selected out of the group 
5 consisting of the throughput of the network, the round 
trip time (RTT) of the network, and at least one of the 
type and dimension of each object included in the web 
pages considered. 

In a particularly preferred embodiment of the 
10 invention, the model corresponds to the relationship: 

where t is the total download time of the page, n 
is the number of objects therein, d is the average size 
of these objects, b is the throughput of the downstream 
15 link (downlink) , h is the dimension of the HTTP 
headers, 1 is the network RTT and X is a free parameter 
to be optimised, namely the parameter whole value 
identifies the "optimum" model used for evaluating 
download performance prediction within a plurality of 
2 0 available models corresponding to the general 
relationship reproduced above. 

Specifically, with the technique described herein, 
the response times to be expected ^during downloading 
can be accurately simulated for each service provider 
2 5 or contents provider without interfering with operation 
of the network. Additionally, an efficiency index can 
be defined representative of the amount each web page 
effectively exploits the capacity of the respective 
network . 

30 The solution described herein gives rise to an 

architecture and an arrangement that permit both the 
download times and the efficiency index related to a 
certain web page to be predicted starting exclusively 
from the number and dimensions of the objects comprised 

3 5 on the web page in question. 
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The main advantage of such an architecture lies in 
that it permits the download times and the efficiency 
index to be evaluated (i.e. estimated) for a large 
number of pages based on an optimised model identified 
5 via measurements carried out on a relatively small set 
of sample pages. _ 
An extensive database can thus be rapidly created 
which is adapted for generating statistics related to 
the typical surfing speed as perceived by the user of a 
10 network such as GPRS/UMTS networks. 

in the presently preferred embodiment, the 
architecture in question includes essentially two 
categories or groups of elements, namely: 

- those elements adapted for carrying out -in the 
15 field" measurements on the network (GPRS and/or UMTS 

for instance) with reference to a set of sample pages, 
thereby permitting identification of a corresponding 
optimum model, and 

- those elements that exploit the model thus 
20 identified for evaluating purposes i.e. for generating 

predictions . 

Brief description of the enclosed dr awings . 
The invention will now be described, by way of 
example only, by referring to the enclosed figures of 
25 drawing, wherein 

- figure 1 is a block diagram of architecture for 
determining model parameters related to downloading web 
pages in a network such as a GPRS or UMTS, 

- figure 2 is a block diagram of architecture for 
30 predicting download times, 

- figure 3 is a flow-chart representing in-the- 
field measurements and calculation of model parameters, 
and 

- figure 4 is flow-chart representing the process 
35 of estimating download parameters. 
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D etailed descripti on of a preferred embodiment_of_ 

the invention. 

in the diagram of figure 1, reference I generally 
denotes a wide area network such as the Internet, while 
5 reference N represents a network, such as a mobile 
telecommunications network, adapted for providing 
access to the network I. Exemplary of the network N 
are for instance, a GPRS or a UMTS network. 

Reference 10 denotes a mobile terminal such as a 
10 mobile GPRS /UMTS terminal used primarily as means for 
conveying data (that is essentially as a modem) . 

Reference 12 is a processing unit such as a 
computer configured for in the field measurements. The 
processing unit 12 is typically a personal computer 
15 (PC) such as a "laptop" portable computer adapted to be 
connected to the mobile terminal 10 to access the 
Internet I via the network N. 

The unit 12 is configured (in a manner known per 
se) in order to perform a set of measurements 

2 0 including: 

- throughput measurement, 

- RTT (round trip time) , 

- download times of selected web pages. 
Reference 14 denotes a server" terminal facility 

25 comprised by one or more servers adapted to be accessed 
via the internet and containing reference files used 
for carrying out measurements of the network 
parameters. Those files may be comprised e.g. of HTML 
pages with given formats and sizes. 
30 Connection of the reference server (s) to the 

network N and to the databases associated therewith (to 
be described in greater detail in the following) takes 
place via respective routers designated Rl and R2 . 

The throughput measurement tool provided in the 
35 computer 12 is adapted to measure throughput by 
downloading corresponding files from the reference 
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server: typically, this may be a HTTP client 
downloading a single HTML file. The results are stored 
by writing them as database items in a measurement 

database 16. . 
5 Similarly, the RTT measurement tool installed xn 

the computer 12 carries out RTT measurements towards 
the reference server (s) 14. Typically, RTT is measured 
by using a method similar to the method used by the 
PING command in operating systems (OS) . The results are 
0 again stored as items in the database 16. 

Reference 18 denotes another database including 
'items comprising a list of sample web pages. This is 
essentially a database including a list of a relative 
small set of web pages intended to be used for 
5 selecting an optimised model to be subsequently used 
for evaluation (i.e. estimation or prediction) purposes 
with reference to a generally much broader set of web 
pages. 

The set of sample pages is chosen in such a way 
2 0 that the sample pages represent in a statistically 
meaningful manner the types of pages for which download 
performance is to be predicted. For instance, the . 
sample pages in question can be selected as the 
homepages of 100 most frequently accessed web sites xn 
25 a certain area. 

As already indicated, the measurement database 16 
includes the results of those measurements carried out 
on the network N (essentially throughput and RTT) for 
each web page in the set of sample pages. 
3 0 For each sample page subject to measurement, the 

following items are usually collected and stored: 

- the page URL, 

- the size of each object therein, 

- the start time and the end time of downloading 
35 each object, 

- the total download time, 
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- the throughput and the RTT of the network at the 
terminal during the time interval where the measurement 
was carried out (the time interval is chosen 
judiciously in such a way that no appreciable 
variations take place in the network parameters while 
measurements are being carried out) , 

- type and version of the browser used. 
After being populated, the database 16 is used for 
calibrating the free parameter (s) in the evaluation 
(i.e. estimation or prediction) model. 

As indicated, such a model may typically comprise 
the sum of : 

. at least one first factor determined 
analytically on the basis of network and web page 
5 parameters, and 

- a second factor being a function, preferably of 
the hyperbolic type, of an optimisation parameter. 

Such a model is typically represented by a 
relationship of the type: • 

where t is the total download time of the page, n 
is the number of objects therein, d is the average size 
of these objects, b is the throughput of the downstream 
link (downlink), h is the dimension of the HTTP 
>5 headers, 1 is the network RTT. 

For a given set of values for n, d, b, h, and 1 
the relationship in question does in fact represent a 
class or set of models, the various models in the set 
being characterized by a respective value of the 

3 0 parameter X. 

Calibrating the free parameter (s) in the 
evaluation model on the basis of the sample web pages 
essentially requires identifying a value for the 
parameter X that corresponds to an -optimum" model, 

35 i e a model best matching the input -to-output 
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relationships that are actually measured in respect of 
the sample web pages. 

Those of skill in the art will promptly appreciate 

that: . 
5 - the models out of which the "optimum" model is 

selected (based on the measurements carried out on the 
sample web pages) may in fact correspond to a plurality 
of different relationships, including heuristic models, 
and 

1Q _ the "free" parameters involved xn the 

optimisation process may be any number, and not just 
one (i.e. X) as in the exemplified case. 

Obviously, increasing the number of parameters 
involved in the optimisation process, will lead to a 
15 more complex, resource- and time-consuming optimisation 
process . 

The experiments carried out by the Applicants have 
however shown that, at least insofar as existing GPRS 
networks are concerned, the simple relationship (I) 
20 reported in the foregoing and involving only one "free- 
parameter (namely X) leads to quite satisfactory 
results . 

in general, different types of models are used for 
evaluating download times and efficiency indexes for 
25 different types of network N. 

The model to be actually used for a specific case 
will be selected depending on the type of network 
considered. 

in fact, each model includes approximations that 
30 apply only for certain network types. Consequently, it 
is necessary to measure certain network parameters 
(essentially the available bandwidth and the RTT) and 
then select on the basis of pre -determined thresholds, 
the model best suited for determining the download 
3 5 times of HTTP pages on such a network. 



WO 2004/097645 



PCT/EP2003/004522 



The *a S «r e «t tool for the download time 
provided in the processing unit 12 measures the time 
P n Ieded for downloading a given wen page by -producing 
the behaviour of certain predefined type of web 
5 browser, for instance Internet Explorer . 

as its input, the tool in question accepts a list 
of web pages to be downloaded. As its output, for each 
web page, the following data are provided: 

- total download time, 

0 - dimension of each object downloaded, 

- start and end times of downloading each object 

downloaded. , 

The results of measurements are stored in the 

database 16. pre£erre d embodiment, a *P^° 

" t00 l (currently available with the applicant as BMPOP 
is used for downloading pages and deriving the 
respective download times in co-operation with a 
"Xffer" for obtaining the dimensions and the download 
20 start and end times for each object. 

in the diagrams of figure 1 and 2, 
designates the database comprising data base items that 
define the model to be optimised for predicting the 
download time of a given web page. 
25 As its input, the database 20 accepts data such as 

network throughput and RTT, the dimensions of each 
object included in the web page and one or more free 
parameters defined experimentally. As its output the 
database 20 provides the download time for a given page 
30 and/or its efficiency factor, as ^J^^ to 
For instance, this may occur (with reference to 
existing GPRS networks) on the basis of the 
relationship (I) considered in the foregoing, where: 

- t is the total download time of the page (i.e. 
35 the output of the model) , and 
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- n is the nuniber of objects in the page, d is the 
average size of its objects, b is the throughput 
available in the downstream link, h is the dimension of 
the HTTP headers, 1 is the network RTT (i.e. the set of 

5 input data to the model) , and 

- X is a factor (parameter) to be established 
experimentally to identify the "optimum" model to be 
used for evaluation purposes. 

Reference 22 denotes a further database (that m 
10 fact may be incorporated with the database 20) 
including the optimum parameters for the models 
computed for each combination of network parameters and 

browser type . , 

in fact, the arrangement shown herein lends itself 
15 to be operated in such a way that the optimum 
parameter^) - e.g. X - are determined for a given 
model type and for a given network type by measuring 
the download times of the set of sample pages and then 
obtaining the best value for the parameter (s) , that are 
20 stored in the database 22. Then the optimum 
parameter(s) - e.g. X - are determined for one or more 
network types in respect of given models (identical or 
different from the one considered), by measuring the 
download times of the set of sample pages and then 
25 obtaining the best value for the parameter (s) in the 
database. 22. The database 22 is thus populated with 
different optimum values to be used for evaluating 
tasks related to different types of networks. 

Optimisation of each model for a given type of 
30 network (e.g. finding the value of X that identifies 
the optimum model in the set expressed by the 
mathematical relationship (I) referred to in the 
foregoing) is performed by an optimiser module 24. 
input data to the module 24 are preferably: 
35 - the type of model to be used (e.g. the 

relationship (I) repeatedly cited in the foregoing) , 
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- throughput and RTT of the network considered 
(e.g, "b" and "1" in the relationship (I)), 

- list of the web pages, 

_ for each page in the list: the start and end 
5 downloading times (whose . difference is the parameter 
«t« in the relationship (I) ) and the dimensions of .all 
the objects comprising such a page (e.g. V, "h" and 
M" in the relationship (I) ) • 

The output of the module 24 is comprised of the 
10 optimum value (s) for the free parameter (s) of the model 
being used ((e.g. "X" in the relationship (I)). 

Specifically, the module 24' operates by allotting 
a given value to the or each free parameter in the 
model (e.g. "X") and then computing the download times 
15 of the pages in the sample set on the basis of such 

value . , . _ 

A comparison is then made with the corresponding 
download times as measured experimentally and a 
-global" error is then computed as a function of the 
20 "partial" errors for each page. 

This may be done by resorting to statistical 
criteria such as e.g. the root means square (RMS) error 
or the peak value of the signal-to-noise ratio (PSNR) 

The value of the free parameter (s) is then varied 
25 searching for the minimum of the global error. 

This result is preferably achieved in a numerical 
manner, e.g. by means of a standard numerical method 
(e g. steepest descent) aiming at minimising the global 
error or reducing it below a predetermined value. 
30 Preferably, the server (s) 14 as well as the 

databases 16, 18, 20, 22 and the module 24 are jointly 
configured in the form of a local area network (LAN) . 

The databases 20 and 22 are intended to co-operate 
with additional databases and other modules xn 
35 evaluating the download performance for a given set of 
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web pages on the basis of the optimum model identified 

in the foregoing. a . h „ 
in figure 2. reference 26 denotes still another 
database including the statistical characteristics of a 
5 list of web pages for which download performance rs to 
be evaluated . 

specifically, for each web page the following 

items are stored: 

- URL, 

10 - list of the objects comprised in the page, 

- dimension of each object. 

This database is populated by means of a web site 
analyser 28 and is subsequently used for determinxng 
the download times of the pages contained therexn 
15 The web site analyser 28 is another module adapted 

to derive the characteristics of the web page to be 
used as the input for a download performance predictor 

30 . .j 
The input to the web site analyser 28 is composed 

20 of a list of web pages to be analysed. 

The output from the analyser 28 is comprxsed, for 
each page in the input list, of the following items: 

- the list of the object comprising the page, and 

- the dimensions of each object. 

25 Such an output is stored in the database 26. 

Typically, the web site analyser 28 is operated on 
a fast network, thus making it possible to collect 
information concerning a large number of web pages xn a 

short time. , 

30 The predictor 30 is comprised of a module adapted 

for calculating the download time and the efficiency 

index for a given web page without actually performing 

any measurement. 

The predictor 3 0 is essentially a software module 
35 adapted to receive as its input data such as the 

network characteristics, the browser type used and the 
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characteristics of the web page while providing as its 
output the download time and the efficiency index 
evaluated for that page. 

in a preferred embodiment, the predictor 30 
accepts as its input the following items: 

- throughput and RTT of the network considered; 

- model and free parameter (s) of the model i.e. 
the "optimum" model to be used for evaluating the 
download performance by way of prediction, and 

- number and dimensions of the object comprised xn 

the page . . . 

- output of the predictor 30 is comprxsed 



The 



20 



j. lie — — 

essentially of the predicted download time for the page 
and its efficiency index. 

Data pertaining to the characteristics of the page 
are read from the web page statistics database 26, the 
parameters to be used are read from the optimised 
parameter database 22 and the results are wrxtten xnto 
a prediction database 32. 

The efficiency index referred to in the foregoxng 
is preferably determined by resorting to a two-step 
procedure. ^ aver age throughput of each 

web page is computed by dividing tie total number, of 
25 bytes therein by the download time. 

Subsequently, the efficiency index is computed as 
the ratio of the web page throughput to the network 
throughput (as measured previously) . 

in the notional absence of protocol overhead, such 
30 an efficiency index would be equal to one. 

The database 32 includes the download txmes and 
the efficiency indexes evaluated for the web pages 
included in the list of the web pages to be analysed by 
means of the "optimum" model defined previously. 
35 The database 32 is populated by the predictor 

module 30 and it includes the download time and the 
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efficiency index as evaluated for each web page (and 
for each network type) , on the basis of the optimised 
model. Preferably, for each web page the followxng 
items are recorded in the database 32: 
5 - download time, 

- estimated efficiency index, 

- network parameters (throughput and RTT) , 

- model, 

- free parameter (s) of the model as used for the 

10 prediction. _ 

Consequently, the database 32 will contaxn (for 
each of the web pages) the expected download time and 
the efficiency index on a given type of network. 

Again, the various blocks shown in figure 2 are 
15 preferably configured in order to co-operate in the 
form of a LAN . In actual fact, the arrangements shown 
in figures 1 and 2 can be regarded as corresponding to 
the same LAN being subsequently re-configured to 
perform two basic processing phases. 
20 These two phases essentially correspond to the 

sequence of steps in the flow-charts of figure 3 and 
figure 4, respectively. 

In the first phase, measurements are earned out 
on the network N in order to determine the 
25 characteristics thereof, while the download times for 
the sample web pages are measured. The results of such 
measurements are used for selecting a preferred 
( "optimum") model and for setting the free parameters 
(e g X) of such a model. In the second phase, the 
30 model performs prediction by using those parameters. 

in the flow-chart of figure 3, starting from a 
step 100, in a step 102 the sample web pages are 
selected and the respective sample files to be used for 
carrying out the network measurements are loaded into 
35 the reference server (s) 14. These files may be 
comprised, for instance, of files of different 
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dimensions to be downloaded via HTTP . The information 
pertaining to the sample web pages thus selected xs 
stored in the database 18. 

in a step 104, throughput and RTT are measured for 
i the network by accessing the reference server (s) 14. 
These measurements are performed by using the tools 
available in the computer 12 and the respective results 
are written in the measurement database 16. 

It will be appreciated that, while represented in 
0 a sequential fashion, the steps 102 and 104 (as well as 
most of the ensuing steps deriving therefrom) can be 
carried out at least partly simultaneously. 

As indicated, the set of sample web pages stored 
in the database 18 is selected as a statistically 
5 meaningful set comprising, for instance, the home pages 
of the most frequently accessed websites in a certaxn 
area. Essentially, this choice is carried out in such a 
way that the selected sample web pages will not be 
appreciably different from the web pages to which the 
20 expected predictions will apply. The statistical 
analysis of the sample web pages is carried out xn a 
step designated 106 and the list of the sample web 
pages is given as input to the measurement tool for the 
download times of the web pages. 
25 This tool performs in a step 108 the respective 

measurements of the download times for the sample web 
pages. The page statistics (for instance dimensions and 
number of objects included therein) and the results are 
stored in the measurement database 16. 
30 In step 110 a candidate model (such as, for 

instance, the relationship I considered in the 
foregoing) is selected from the model database 20. 

This is preferably done depending on a certaxn 
predetermined threshold concerning the network 
35 parameters. For instance, if the measurements have been 
performed on a low throughput network, a model is 
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selected where the server processing times are 
neglected. Conversely, if a high throughput network is 
being considered, a model taking into account also 
those processing times will have to be chosen. 
5 in step 112 the optimiser module 24 is activated 

in order to process the data comprised of the 
m easurement results and the type of model chosen. The 
purpose of optimisation is to derive one or more 
optimum parameters (i.e. the optimum model) for the set 
10 of sample pages chosen to be memorized in the parameter 
database 22. The optimum parameter (s) thus obtained can 
be subsequently used for predicting, via the module 30, 
the download times for each page on the same type of 
network. 

15 For instance, assuming the model is represented by 

the relationship (I) referred to in the foregoing, the 
purpose of optimisation is to identify an optimum value 
of the parameter X that minimises or reduces below a 
predetermined value the difference (error) between the 
20 download times for the sample pages as actually 
measured and as evaluated by way of prediction using 
the model, respectively. 

Such an optimisation process may be repeated for 
different types of networks for which download 
25 performance is intended to be evaluated, so that 
optimum models can be obtained and stored for different 
types of network to be subsequently analysed. 

The download times predicted are compared to the 
corresponding download times as actually measured for 
30 those pages to define a global error associated with 
the model /parameters under test. 

As indicated, the global error is defined as an 
entity (e.g. MSB, PSNR) indicative of the difference 
between the predicted values and the values measured 
3 5 over the whole set of the sample web pages. 
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The optimisation process is thus of iterative 
nature . 

The step 114 is representative of the calculation 
of the global error on the basis of the optimised 
5 parameters. In fact, the step 114 may be regarded as 
representative of the global error calculated m the 
final iteration of the optimisation process of the step 
112. 

in a comparison step 116 the global error xn 
0 question is compared with a fixed threshold adapted to 
be defined empirically. If the comparison test xs not 
passed (i.e. the global error is higher than the 
threshold), a substantial likelihood exists that the 
model used is not by itself a correct one: for instance 
L5 a model has been chosen that does not take into account 
the processing times at the web server, while a -fast" 
(i e a high throughput) network is considered, whereby 
the processing times in question must be accounted for 
in the model in order to obtain satisfactory results. 
20 m that case (negative outcome of the step 116) 

the system evolves via a return step 117 to the 
selection step 110, which leads to a new type of model 
being selected. 

The whole optimisation process" is thus repeated 

2 5 with the aim of obtaining a lower global error. 

A positive outcome of the test of step 116 
indicates that the global error obtained on the basis 
of the optimised model (e.g. in the case of the 
relationship I referred to in the foregoing, an 

3 0 optimised value for X giving the minimum global error) 

is acceptable. 

The system thus evolves to a step 118, whxch 
represents the beginning of the second phase 
represented by the flow-chart of figure 4. 
35 In such a second phase, the download performance 

(e.g. the download times and the efficiency indexes) is 
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evaluated for a selected set of pages (use pages) . This 
is done in a non- intrusive manner, by way of 
prediction, using the optimised model defined in the 
previous phase 

5 in a step 12 0 the data base items comprising the 

list of the selected pages or use pages is read from 
the database 26 and in a step 122, the site analyser 28 
is activated. The site analyser .28 is currently 
operated on a fast network in order to obtain m a 

10 short time statistics data related to a large number of 
pages . 

For each page in the set of the selected web 
pages, the analyser 28 determines the list and the 
dimensions of the respective objects to be determined. 
15 These items are memorised in the statistics database 
26. 

In a subsequent step 124 the predictor 30 is 
activated so that the total download time and the 
efficiency index are determined for each page in the 
20 list. The data concerning the pages are read from the 
statistics database 26, while the model /parameters to 
be used by the predictor are read from the respective 
databases 20 and 22. 

The results are stored in the prediction database 
25 32, and the system then evolves to a final step 126. 

The final result is an evaluation of the download 
times (and the efficiency indexes) for a selected 
number of pages among those accessible, and thus 
downloadable, via the network N. 
30 It will be appreciated that the download times 

(and the efficiency indexes) evaluation can be a useful 
tool both for 

- service providers in order to permit them to 
realise web pages having download times corresponding 
35 to the user requests; and 
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- network operators in order to permit them to 
know download times in a non- intrusive way. 

While the number is these pages may be very high, 
the download performance data can be evaluated by way 
of prediction in a short lapse of time. This takes 
place without interfering in any way with operation of 
the network N by using an optimised model defined on 
the basis of a set of sample pages including a 
relatively small number of sample pages (e.g. the home 
pages of the 100 most visited sites) that are 
statistically homogeneous with the pages whose download 
performance is to be evaluated. 

Of course, without prejudice to the underlying 
principle of the invention, the details and embodiment 
5 may vary, also significantly, with respect to what has 
been described by way of example only, wxthout 
departing from the scope of the invention as defined by 
the annexed claims. 



